R«c'd PCT/PTO 0 8 MAR 2005 
10/527107 

WO 2004/025840 PCT/IB2003/003870 

1 

Mefhod ^nd device for source decoding a variable-lengfli soft-input codewords sequence 



FIELD OF THE INVENTION 

The present invention relates to a method for source decoding a variable- 
length soft-input codewords sequence into a soft-output bit sequence. 

Such a method may be used in any system using variable-length codes like, for 
example, a video or audio communication system. 

BACKGROUND OF THE INVENTION 

A video commmiication system typically comprises a source encoding system, 

a channel and a source decoding system. The source encoding system generates variable- 
length codewords sequences and transmits them over the channel to the source decoding 
system that decodes them thanks to a shared code. 

Variable-length codes, which are widely used in video coding standards for 
then: compression capabilities are very sensitive to channel errors. As a matter of fact, when 
some bits are altered by the channel, synchronisation losses can occur at the receiver side, 
possibly leading to dramatic symbol error rates. This phenomenon has led to introduce 
modified variable- length codes such as e.g. self-synchronising Hufi&nan codes, reversible 
variable-length codes. 

Another solution is to re-introduce redundancy in the bitstream by inserting an 
error correcting code in the chain. The key point of the latter solution is to appropriately use 
the residual source redimdancy at the decoding side. Being considered as a form of implicit 
channel protection by the decoder, this redimdancy can be exploited as such to provide error 
correction capability of the variable length coded source. 

Recent work showed that low-complexity approximate MAP (Maximimi A 
Posteriori) decoders could be used, that provide approximately the same performance as all 
existing soft decoding algorithms while exhibiting a complexity close to the hard decoding 
case. Such a decoder is disclosed in [L. Perros-Meilhac and C. Lamy. "Huffinann tree based 
metric derivation for a low-complexity sequential soft VLC decoding". In Proceedings of 
ICC'02, volume 2, pages 783-787, New York, USA, April-May 2002]. 

Even though MAP algorithms exist and proAdde very good results in temis of 
error correction, they are very complex. On the other hand, stack algorithms, as the one 
disclosed in [Buttigieg : "Variable-length error-correcting codes" PhD thesis. University of 
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Manchester, United Kingdom, 1995] are much less complex and can reach singular 
performance. However, they are primarily symbol level decoding algorithms and thus are not 
well adapted to provide reliability information on bits. 

The aim of the invention is to provide a method for soiirce decoding a 
variable-length soft-input codewords sequence working at a bit level and at low complexity 
for VLC codes. 

SUMMARY OF THE INVENTION 

The subject matter of the invention is a method for source decoding a variable- 
length soft-input codewords sequence as defined in claim 1 . 

In addition, there is provided a source decoder for decoding a variable-length 
soft-input codewords sequence as defined in claim 10. 

Additional features are disclosed in the other claims. 
As we will see in detail fiarther on, such a method has the advantage to provide a reliability 
information (or soft-output) on the decoded sequences, allowing to select paths in a tree in an 
increasing order of their metric value. Thus, by searching for only useftil codewords, the 
proposed method is very efficient in term of CPU cost, complexity and time, as many other 
paths in the decoding tree are not considered anymore. The soft-output in formation allows 
then to perform iterative joint decoding when the VLC encoder has been concatenated with 
another. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will be better imderstood from reading the foUovdng description 
which is given solely by way of example and in which reference is made to the drawings, in 
which : 

- figure 1 is a general schematical view of a transmission chain, comprising an 
encoder and a decoder according to the invention; 

- figure 2 is a VLC codewords table associated with the variable-length 
codewords sequences used in the encoder and the decoder of figure 1 ; 

- figure 3 is a detail view of a joint decoder of figure 1 including a decoder 
according to the invention ; 

- figure 4 is a flow chart of the soft-input soft-output variable-length 
codewords sequence decoding method according to the invention ; 
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- figure 5 is a representation of a tree associated with the VLC table of figure 

2; 

- figure 6y 7 and 8 are representations of concatenated code trees associated 
with the VLC table of figure 2 created during implementation of the decoding method 

5 according to the invention; and 

- figures 9 and 10 are graphs showing performances of the method according 
to the invention compared to other method. 

DETAILED DESCRIPTION OF THE INVENTION 

A transmission chain for joint decoding between convolutional code and 

.0 variable-length code is shown on figure 1 . 

This transmission chain comprises a transmitter 1 and a receiver 2. 

Variable-lengths codes (VLC) are used in die transmission chain to reduce the 

length of the transmitted bitstreams. 

The transmitter 1 includes a variable length source 10 which is adapted to 

15 output a random VLC symbols sequence s according to chosen VLC codewords probabilities. 

At the output of the VLC source, the transmitter 1 comprises a variable length 

encoder 12, a pseudo-random interleaver 14, a systematic convolution (CC) encoder 16, a 

puncturer 18 and a BPSK modulator 20. 

The random VLC symbols sequence s is encoded in the VLC encoder 12 

20 which is a adapted to map the received symbols, for example grouped into packets of R 

symbols, to a T-bit VLC sequence denoted x [1:T]. The bit sequence x [1:T] is created, as 

known per se, following the rule edicted by a VLC table. An example of VLC table is given 

on figure 2. 

In the VLC table, the code C is defined as follows. A codeword is associated 
25 with a symbol Si, a codeword having a predetermined length. A value of a codeword 

represents as well the value that can be taken by the associated symbol Si. Besides, to each 
codeword, a probability of appearance P (Sk) is associated. 

Each sequence x [1 :T] is then permuted by the interleaver 14 to obtain an 
interleaved sequence x [1 : T]. 
30 This interleaved sequence 3c [1 : T] is given as input to the systematic 

convolutional encoder 16. The coded sequence denoted v [1 : T x n/k] at the output of the CC 
encoder 16 is eventually punctured by the puncturer 18 in order to attain the wished 
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transmission rate, modulated by the BPSK modulator 20 and transmitted over a channel with 
variance o^. 

The receiver 2 includes a demodulator 30, an eventual depuncturer 32 and an 
iterative decoder or joint decoder 34. 

5 The depuncturer 32 is adapted to introduce into the received sequence, a zero- 

value samples to replace the samples removed from the coded sequence by the puncturing 
operation. The depunctured sequence denoted z [1 : T x n/k] is then decoded by the iterative 
decoder 34, the flowchart of which is detailed on figure 4, 

As shown on figure 3, the iterative decoder 34 comprises a soft-input soft- 

10 output (SISO) CC decoder 42 and a soft-input soft-output (SISO) VLC decoder 44 according 
to the invention. The reliability information provided by the CC (resp. VLC) decoder is 
interleaved in 46 (resp 48) and is used to improve the decoding of the VLC (resp. CC) 
decoder. 

The SISO CC decoder 42 is for example as disclosed in [L. Bahl, J. Cocke, F. 
15 Jelinek, and J. Raviv. "Optimal decoding of linear codes for minimizing symbol error rate". 
IEEE Transactions on Information Theory, 20:284-287, March 1974]. 

For decoding a sequence y [1 : T], and as known per se, the SISO VLC 
decoder 44 is adapted to compute a soft-output in the form of the log a posteriori ratio 
A(x[t]) : 

and a hard bit estimate sequence denoted jc,, [1 : T] or a hard symbol estimate sequence 
denoted Sy [1 : R]. The hard symbol estimate sequence is derived from the hard bit estimate 
sequence by applying the VLC table. 

The VLC SISO decoder 44 will derive the reliabiUty information on bits A 
25 (x[t]) by taking into account the a priori knowledge it had on the VLC encoder. In this case, 
the a priori knowledge will consist in a VLC tree structure derived from the VLC table, the 
occurrence probabilities of the symbols P (Sk) and any other source side information denoted 
SSI, such as possibly the number of symbols of the concerned sequence. 

A summary of the iterative decoding method is given hereafter. 
30 1. Initialise O%] = 0. 

2. For iterations r = 1, 2, I, where I is the total number of iterations 



wo 2004/025840 PCT/IB2003/003870 

5 

• compute Ac^ [1 • Tl applying a SISO CC decoder xising the depunctured 
received sequence z[l : T x n/k] as observation and O (r 1) [1 : T] as a priori probabilities 
ratio, 

compute £^c'^ [t] = /^S^ [t] - E^f'^ [t] for 1 ^ t ^ T, 

_2 



observation. 



compute y^'>[t] = ^ E^c^ W for 1 ^ t ^ T, 



compute [t] applying a SISO VLC decoder 44 using y^'^ [1 : T] as 



compute £^r> [t] = Alr> [t] - E^c^ W for 1 ^ t ^ T, and 
compute 0«[t] = [t] for 1 ^ t ^ T. 
.0 More precisely, the iterative decoding process takes place as follows : At the 

r* iteration, the CC decoder's input consists in the depunctured sequence z[l : T x n/k] and 
the a priori probabilities ratio O ^^'^^ [1 : T] of the interleaved sequence denoted x [1 :T] 
obtained at previous iteration. The CC decoder 42 provides the Ac^ [1 • T] ou^ut sequence. 

At this same r* iteration, tiie VLC decoder 44 takes as input the observation 
1 5 sequence y^^^ [1 : T] derived of the CC decoder ou^ut sequence, the a priori probabilities of 
VLC symbols as well as any other available source side information SSI and provides the 
Ac^ [1 : T] output sequence. 

To have each decoder taking advantage of the iterative process, independent 
information must be exchanged between the two decoders, that is the so-called extrinsic 
20 information, [t] and Ev^ [t] are defined as the extrinsic information about the bit t 
provided respectively by the CC decoder 42 and by the VLC decoder 44. 
forr ^1, 

E^'m = A^^>[t]-^r^ut], 
^^.'■^[t] = A<;>[t]-£^^>[t], 

25 where E^y^ [t] is set equal to zero. 

The CC extrinsic information Ec^ [1 • T] sequence scaled by a^/2 is used as 

2 

observation for the r* iteration of VLC decoder, thus y^'^[t] = £!^S^ [t]. 

The interleaved VLC extrinsic information Ev^ [1 : T] is used as a priori 
probabilities ratio estimate for the r + 1^ iteration of the CC decoder 42, 
30 = Ey^[l:T]. 
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The SISO VLC decoder 44 implements a soft-input soft-output (SISO) stack 
method, the algorithm of which is disclosed on figure 4. 

The method proceeds in two main stages : 

- a hard decoding stage 100 for estimating an hard transmitted VLC sequence ; 

5 and 

- a post processing stage 102 for deriving the soft values. 

The first main stage 100 of the method consists in applying a stack sequential 
decoding algorithm on a "huge*' tree formed by concatenating several times a considered 
Huffman tree until for example the number of bits and the number of symbols in the 
10 considered sequence are reached. 

The first step 111 of the hard decoding stage 100 consists hi creating a 
structure tree by defining relationships between nodes and computing an a priori probability 
associated with each branch of the tree. The imitary Hufftnan tree corresponding to VLC 
codes of figure 2 is shown on figure 5. 
15 Such a tree comprises a plurality of : 

- nodes N, a plxirality of nodes corresponding to a possible codeword or a 

symbol Si ; 

- branches B, a metric M being associated with each branch B. A branch B is 
composed of two nodes N, in other words, from a node N, one or two branches can be 

20 created, a "left" branch and a "right" branch ; a branch has an associated bit value 0 or 1 ; 

- paths, a path representing a decoded bit sequence. 

A path comprises a plurality of branches B and goes firom an initial node NOO 
to a succeeding node which may be a symbol Si. 

Besides, a tree has different levels, the first one being the level 0, 
25 At step 1 12, a stack is defined. This stack is memorized at each step of the 

hard decoding stage 100 in order to be later used durmg the post processing stage 102. 
The stack contains : 

- a matrix containing selected "paths" i.e. the sequence of node indexes 
forming these paths, 

30 - a vector of integers storing for each path ui the stack the number of decoded 

bits up to the current time, 

- a vector of integers storing for each path in the stack the number of decoded 
symbols up to the current time, and 
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- a vector of float storing for each path in the stack the associated cumulative 

metric. 

The stack is initialised by placing the initial node NOO with metric 0 in the 
stack. Initial node NOO is considered as being the top path in the decoding stack. 
5 At step 1 13, a metric of the succeeding branches of the last node of the top 

path is computed. 

As sequential decoding methods compare sequences of different lengths, a 
specific metric is used. For each node 1 in the set Np, the metric associated to the branch 
leading to this node at time t is defined as follows : 
0 m(l, y[t]) = - log P(y[t] I v(l)) - log pt(l) + log Po(y[t]). 

• Np : the set of nodes having a predecessor, 

• pt(l) (1 € Np): the a priori probability of the branch reaching the node 1 at 

timet; 

• v(l) (1 e Np): the value of the branch reaching the node 1, v(l) e {0,1}; 
5 • Po(y [t]) : a Fano - Massey metric which allows to compare fairly 

sequences of different lengths. 

The term pt (1) will in practice be approximated by the a priori probability of 
the branch p(l) for simplicity reason. This last quantity can be directly obtained firom the tree 
representation of the YLC table and the codeword probabilities which are assumed to be 
20 known by the decoder as explained in \L. Guivarch, J.-C. Carlach, and P. Siohan. "Joint 
source-channel soft decoding of Huffinan codes with turbo-codes". In Proceedings of the 
Data Compression Conference (DCC'OO), pages 83-91, Snowbird, Utah, USA, March 2000]. 

At step 1 14, a test is performed to determine whether an extended path reaches 
a symbol node. An extended path consists in the current path concatenated with a possible 
25 succeeding branch. 

If such a symbol node is reached by an extended path, the number of symbols 
associated to this path is increased at step 115. 

The increasing of the number of symbols consists in concatenating the 
considered unitary Huffinann tree corresponding to the VLC codes with the initial node at the 
30 symbol node reached by the extended path. 

The top path is deleted from the stack at step 116. The top path is tiie path 
mentioned in the first line of the stack. It is also the path of the stack having the smallest 
cumulative metric. 

The extended paths are then inserted in the stack at step 1 17. 
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The cumulative metric of each new top path is computed and stored in the 
stack. The cumulative metric is equal to the cumulative metric of the previous top path 
increased with the metric of the branch added to obtain the extended path. 

At step 1 1 8, a new top path is selected. The new top path selected is the path 
of the stack having the smallest cumulative metric among the paths listed in the stack. 

Next, it is checked if stop conditions are verified at step 121. The stop 
conditions are for example that the top path contains the number of bits and the number of 
symbols of the original sequence. 

If stop conditions are verified at step 121, then step 122 is carried out. 
Otherwise, step 1 14 and foUowings are repeated. 

During the hard decoding stage 100, the content of the stack is stored at each 

step. 

At step 122 corresponding to post processing stage 102, soft-output values are 
derived from the cumulative metrics of paths considered in hard decoding stage 100 and 
stored in the stack. Post processing stage algorithm will be disclosed later. 

An illustrated example of hard decoding stage 100 is given hereafter. 

Step 111 : creation of the structure tree by defining relationships between 
nodes computing the a priori probability associated with each branch. 

Step 112 : initialization of the stack 



Cumulative metric 
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NOO 



PTH 



PTHOO 



Nb of bits 



Nb of symbols 



0 



10 Ste p 113 : computation of the metric of the succeeding branches of last node 

NOO. 

They are calculated as follows : 
M(NOO.N10) = M10 
M(N00-N11) = M1L 

15 Step 1 14 : The extended path reaching node NIO reaches a symbol node. Thus, 

step 115 is implemented for node NIO. 

Step 115 : concatenation of the considered imitary Hufinann tree with the 
initial node at the reached symbol node NIO. The resulting tree is shown on figure 6. 

Steps 116& 117 : deletion of the top path PTHOO from the stack and insertion 
30 of the extended paths in the stack 
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Step 118 : selection of the new top path i,e. the path having the smallest 
cumulative metric. 

(We assume that the smallest cumulative metric is Mil) 
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[21: stop condition is assumed to be not verifiec 





Step 113 : the last node of the top path is Nl 1 

The metric of the succeeding branches of last node of the top path are 
calculated as follows : 

M(N11-N20) = M20 
M(N11-N21) = M2L 

Step 1 14 : the extended paths reaching nodes N20 and N21 reach a symbol 
node, thus step 1 15 is implemented for nodes N20 and N21. 

Step 115 : concatenation of the considered unitary Huffinan tree with the initial 
node at the reached symbol node N20 and N21 . The resulting tree is shown on figure 7. 

Steps 116 & 117 : 
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Step 118 : selection of the new top path, i.e. : the path having the smallest 
cmnulative metric. 

(We assume that the smallest cumulative metric is Ml 1 + M21). 
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121 : stop condition assumed not be verified. 
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Step 113 : the last node of the top path is N21 = NOO. The metric of the 
succeeding branches of last node of the top path are calculated as follows : 

M(N21-N34) = M'10 
M(N21-N35) = M'll. 

Step 1 14 : the extended path reaching node N34 reaches a symbol node, thus 
step 1 15 is implemented for N34. 

Step 115 : concatenation of the considered unitary Huffinan tree with the initial 
node NOO at the reached node N34 as shown on figure 8. 

Steps 116 & 117 : deletion of the top path PTH21 firom the stack and insertion 
of the extended paths. 
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Step 118 ; selection of the new top path i.e., the path having the smallest 
cumulative metric. 

(We assume that the smallest cumxilative metric is Ml 1 + M20). 
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The steps as above disclosed are repeated until the stop conditions are verified 

15 at step 121. 

Once the sequential decoding process is finished, the post-processing takes 
place and generates soft-outputs. These soft-outputs must consequently be extracted from the 
paths that have been examined and stored by the stack decoding algorithm. 
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Let {Pi,. . .,Pr} be the r examined paths stored in the stack. A given path Pi (I ^ 
i ^ r) is characterised by a length in bit T Pi, a sequence of bits {jc^, [l],..-,jcp^ [TpJ} anda 
cumulative metric fip^. 

A first solution proposes to approximate the log-likelihood ratio A (x [t]) by: 
5 A(x[t])-;u(t,0)-//(t,l), 

where // (t, 1) (resp. // (t, 0)) is the minimum cumulative metric for all the 
paths in the stack for which the t* estimated bit is 1 (resp. 0). 

If P* is the path selected by the decoding process, then if xP * [t] = i (i = {0, 

1}), we have 

0 //(t,i)=/^P*. 

As a consequence, for each time t, the minimum cumulative metric of the 
paths with complementary bit to the estimated sequence has only to be determined. 

In a second solution, not only the best paths for both values 0 and 1 for t^ 
estimated bit are taken into account, but all the metrics in the paths stored by the stack 
5 algorithm. So, the log-likelihood ratio A (x [t]) is approximated by 

A(x[t]) = log( S e»^^i/2 e^^^i) 
l<i<r l<i<r 
Tpi>=t Tpi>=t 
Pi[t]=l JCpi[t]=0 

to 

Figures 9 and 10 show performance of the method according to the invention 
compared with other methods for source decoding of variable-length sequences. 

• The following method considered : Hard, which is the traditional hard- 
15 input VLC algorithm, without any a priori knowledge, 

• KMAP, i.e. the MAP algorithm which consists in a Maximxmi A 
Posteriori decoding of VLC codes with exact a priori knowledge on the number of symbols 
by fi^ame. 

• SOSAl denotes the Soft Output Stack Algorithm using the first 
30 solution to generate soft output values, 

• SOSA2 denotes the Soft Output Stack Algorithm using the second 
solution to generate soft output values. 

The following codes are considered in the communication chain: VLC code C 
35 given in Table 1 followed by convolutional code CCa with coded bits punctured using the 
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puncturing table defined in Table 2. The results are given in terms of Frame Error Rate (PER) 
and Signal to Noise Ratio (SNR). 





Prob. 


C 


Si 


0.33 


00 


Sz 


0.30 


11 


S3 


0.18 


010 


S4 


0.10 


101 


Ss 


0.09 


Olio 


Average length 


2.47 



5 Table 1 : Short VLC code used in the simulations. 

111111111 
110 110 111 

.0 Table 2: Puncturing table used in the simulations: puncturing rate R=8/9. 

Figure 9 first illustrates the performance of both SOSAl and S0SA2 
algorithms for the same chain at the output of the VLC decoder 44 for various iterations. The 
size of the stack is set to 20. The two different solutions proposed are roughly equivalent, 
with perhaps a very small advantage to SOSAl. 

15 Figure 10 shows a performance comparison between the KMAP and the 

SOSA2 algorithms for the same chain at the output of the VLC decoder 44 for various 
iterations. As foreseen by previous studies on hard outputs versions of these algorithms in 
particular [L. Perros-Meilhac and C. Lamy. "Huffinann tree based metric derivation for a 
low-complexity sequential soft VLC decoding". In Proceedings of ICC'02, volume 2, pages 

20 783-787, New York, USA, April-May 2002], both perform similarly for the first iteration. 
However, the gain of about 0.8 dB obtained with KMAP algorithm for FER=10"^ after 4 
iterations is not completely reached wifli S0SA2 algorithm, which only provides about 0.3 
dB of gain. 

The stack algorithm in only about 0.5 dB worse than KMAP algorithm for a 
25 much lower complexity. 



wo 2004/025840 PCT/IB2003/003870 

13 

Yet, comparing these results with wlat would be obtained when replacing the 
SISO VLC decoders in the receiver by a classical hard VLC decoder (hence without 
iterating), the gain provided by the iteration process was found as being substantial. 



